SEQUENCE LISTING 

<U0> Nilles, Matthew L. 
Matson, 3yl S. 

<120> YERSINIA SPECIES COMPOSITIONS 

<130> 3128-6046US 

<150> US 60/444,076 

<151> 2003-01-31 

<160> 20 

<170> Patentln version 3.2 

<210> 1 

<211> 87 

<212> PRT 

<213> Yersinia pestis 
<220> 

<223> amino acid sequence of YscF 

<223> sequence can be found at MedLine accession number NP_857921.1 



Met ser Asn Phe Ser Gly Phe Thr Lys Gly Thr Asp lie Ala Asp Leu 
15 10 15 

Asp Ala val Ala Gin Thr Leu Lys Lys Pro Ala Asp Asp Ala Asn Lys 
20 25 30 

Ala val Asn Asp Ser lie Ala Ala Leu Lys Asp Lys Pro Asp Asn Pro 
35 40 45 

Ala Leu Leu Ala Asp Leu Gin His Ser lie Asn Lys Trp Ser val He 
50 55 60 

Tyr Asn lie Asn Ser Thr He Val Arg Ser Met Lys Asp Leu Met Gin 
65 70 75 80 

Gly lie Leu Gin Lys Phe Pro 
85 

<210> 2 
<211> 264 
<212> DNA 

<213> Yersinia pestis 
<220> 

<223> nucliec acids encoding YscF 

<223> sequence can be found at MedLine accession number NLJ304839 



180 
240 



atgagtaact tctctggatt tacgaaagga accgatatcg cagacttaga tgcggtggct 60 
caaacgctca agaagccagc agacgatgca aacaaagcgg ttaatgactc gatagcagca 120 
ttgaaagata agcctgacaa cccggcgcta cttgctgact tacaacattc aattaataaa 
tggtcggtaa tttacaatat aaactcaacc atagttcgta gcatgaaaga cttaatgcaa 
ggcatcctac agaagttccc ataa 264 

<210> 3 

<211> 264 

<212> DNA 

<213> Yersinia enterocolitica 

<220> n ,. . 

<223> nucleic acid encoding YscF from Y. enterocolitica 

<223> sequence can be found at MedLine accession number NC_005017 
<400> 3 

atgagtaatt tctctgggtt tgcaaaagga accgatatca cagacttaga tgcggtggct 60 

caaacgctca agaagccagc agacggcgca aacaaagcgg ttaatgactc gatagcagcg 120 

ttgaaagaga cgcctgacaa cccggcgcta cttgctgact tacaacattc aattaataaa 180 

tggtcggtaa tttacaatat aagctcaacc atagttcgta gcatgaaaga cttaatgcaa 240 

ggcatcctac agaagttccc ataa 264 



<210> 4 

<211> 87 

<212> PRT 

<213> Yersinia enterocolitica 

<220> . 

<223> amino acid sequence of YscF from Y. enterocolitica 

<223> sequence can be found at MedLine accession number NP„863538.1 



Met Ser Asn Phe ser Gly Phe Ala Lys Gly Thr Asp He Thr Asp Leu 
15 10 15 

Asp Ala val Ala Gin Thr Leu Lys Lys Pro Ala Asp Asp Ala Asn Lys 
20 25 30 

Ala val Asn Asp ser lie Ala Ala Leu Lys Asp Thr Pro Asp Asn Pro 
35 40 45 

Ala Leu Leu Ala Asp Leu Gin His ser lie Asn Lys Trp Ser val He 
50 55 60 
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Tyr Asn Met Ser ser Thr lie Val Arg Ser Met Lys Asp Leu Met Gin 
65 70 75 80 

Gly lie Leu Gin Lys Phe Pro 

<210> 5 

<211> 80 

<212> PRT 

<213> salmonella enterica 
<220> 

<223> amino acid sequence of Prgl 

<223> sequence can be found at MedLine accession number CAD05980.1 



Met Pro Thr Ser Trp Ser Gly Tyr Leu Asp Glu val ser Ala Lys Phe 
1 5 10 15 

asp Lys Gly val Asp Asn Leu Gin Thr Gin val Thr Glu Ala Leu Asp 
20 25 30 

Lys Leu Ala Ala Lys Pro Ser Asp Pro Ala Leu Leu Ala Ala Tyr Gin 
35 40 45 

ser Lys Leu Ser Glu Tyr Asn Leu Tyr Arg Asn Ala Gin Ser Asn Thr 
50 55 60 

val Lys Val Phe Lys Asp He Asp Ala Ala He He Gin Asn Phe Arg 
65 70 75 80 

<210> 6 
<211> 243 
<212> DNA 

<213> salmonella enterica 

<223> sequence can be found at MedLine accession number AL627276 
<220> 

<223> nucleic acid encoding Prgl 
<400> 6 

atgccaacat cttggtcagg ctatctggat gaagtttcag caaaatttga taagggcgtt 60 
gataatctac aaacgcaggt aacagaggcg ctggataaat tagcagcaaa accctccgat 120 
ccggcgctac tggcggcgta tcagagtaag ctctcggaat ataacttgta ccgtaacgcg 180 
caatcgaaca cggtaaaagt ctttaaggat attgatgctg ccattattca gaacttccgt 240 
taa 243 



3 



<210> 7 

<211> 83 

<212> PRT 

<213> Shigella flexneri 
<220> 

<223> amino acid sequence of MxiH 

<223> sequence can be found at MedLine accession number NP_858270.1 



Met ser Val Thr Val Pro Asn Asp Asp Trp Thr Leu Ser ser Leu ser 
15 10 15 

Glu Thr Phe Asp Asp Gly Thr Gin Thr Leu Gin Gly Glu Leu Thr Leu 
20 25 30 

Ala Leu Asp Lys Leu Ala Lys Asn Pro ser Asn Pro Gin Leu Leu Ala 
35 40 45 

Glu Tyr Gin Ser Lys Leu ser Glu Tyr Thr Leu Tyr Arg Asn Ala Gin 
50 55 60 

Ser Asn Thr val Lys val lie Lys Asp val Asp Ala Ala Xle lie Gin 
65 70 75 80 

Asn Phe Arg 



<210> 8 
<211> 252 
<212> DNA 

<213> shigella flexneri 
<220> 

<223> nucleic acid encoding MxiH 

<223> sequence can be found at MedLine accession number Nc_004851 
<400> 8 

atgagtgtta cagtaccgaa tgatgattgg acattgagtt cattatctga aacttttgat 60 
gatggaactc aaacattaca aggtgaacta acattggcac tagataaatt agctaaaaat 120 
ccttcgaatc cacagttgct ggctgaatac caaagtaaat tatctgaata tacattatat 180 
aggaacgcgc aatccaatac agtgaaagtg attaaggatg ttgatgctgc aattattcaa 240 



aacttcagat aa 



<210> 9 

<211> 73 

<212> PRT 

<213> Escherichia coli 



252 



<220> 

<223> amino acid sequence of EscF 

<223> sequence can be found at MedLine accession number NP_312579.1 



Met Asn Leu ser Glu lie Thr Gin Gin Met Gly Glu val Gly Lys Thr 
15 10 15 

Leu ser Asp Ser val Pro Glu Leu Leu Asn ser Thr Asp Leu val Asn 
20 25 30 

Asp Pro Glu Lys Met Leu Glu Leu Gin Phe Ala val Gin Gin Tyr Ser 
35 40 45 

Ala Tyr val Asn val Glu Ser Gly Met Leu Lys Thr lie Lys Asp Leu 
50 55 60 

val ser Thr lie ser Asn Arg ser Phe 
65 70 

<210> 10 
<211> 222 
<212> 
<213> 

<220> 

<223> nucleic acid encoding EscF 

<223> sequence can be found at MedLine accession number NC_002695 
<4( > 0> 10 

atgaatttat ctgaaattac tcaacaaatg ggtgaagtag gtaaaacgct gagcgattct 60 
gtgccagagt tacttaatag caccgatttg gttaatgacc ctgaaaaaat gttagagttg 120 
cagtttgcgg ttcagcaata ttctgcttat gttaacgtag aaagtggaat gttgaaaacg 180 
ataaaagatc tggtctcaac catttctaac cgtagttttt aa 222 



<211> 240 

<212> DNA 

<213> Yersinia pestis 

<223> nucleic acid encoding ht-Yscf; an example of a recombinant YscF 
<220> 

<221> misc_feature 

<222> (214).. (237) 

<223> His-tag sequence 
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<400> 11 



atgagtaact tctctggatt tacgaaagga accgatgatg caaacaaagc ggttaatgac 60 

tcgatagcag cattgaaaga taagcctgac aacccggcgc tacttgctga cttacaacat 120 

tcaattaata aatggtcggt aatttacaat ataaactcaa ccatagttcg tagcatgaaa 180 

gacttaatgc aaggcatcct acagaagttc ccactcgagc accaccacca ccaccactga 240 



<210> 12 

<211> 95 

<212> PRT 

<213> Yersinia pestis 



<223> amino acid sequence of HT-YscF; an example of a recombinant YscF 
<220> 

<221> misc_feature 

<222> (88).. (95) 

<223> His-tag 

<400> 12 

Met ser Asn Phe Ser Gly Phe Thr Lys Gly Thr Asp lie Ala Asp Leu 
15 10 15 



Asp Ala val Ala Gin Thr Leu Lys Lys Pro Ala Asp Asp Ala Asn Lys 
20 25 30 



Ala val Asn Asp Ser lie Ala Ala Leu Lys Asp Lys Pro Asp Asn Pro 
35 40 45 



Ala Leu Leu Ala Asp Leu Gin His Ser He Asn Lys Trp Ser val He 
50 55 60 



Tyr Asn lie Asn ser Thr lie val Arg ser Met Lys Asp Leu Met Gin 
65 70 75 80 



Gly lie Leu Gin Lys Phe Pro Leu Glu His His His His His His 
85 90 95 



<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PMJ119 



<220> 
<221> 
<222> 
<223> 




kan encoded on complementary strand 



<220> 

<221> 



mi 5c_feature 
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<222> (3518).. (4597) 

<223> Laci encoded on complementary strand 
<220> 

<221> promoter 

<222> (49S3). .(5000) 

<223> T7 promoter 

<220> 

<221> misc_feature 

<222> (5001). .(5001) 

<223> T7 transcription start 

<220> 

<221> misc_feature 

<222> (5072). .(5104) 

<223> T7 Tag 

<220> 

<221> CDS 

<222> (5114). .(5401) 

<223> HT-YSCF 

<220> 

<221> misc_feature 

<222> (5381).. (5398) 

<223> His-tag sequence 

<220> 

<221> terminator 

<222> (5466).. (5512) 

<223> T7 terminator 



<400> 13 
tggcgaatgg 


gacgcgccct 


gtagcggcgc 


attaagcgcg 


gcgggtgtgg 


tggttacgcg 


60 


cagcgtgacc 


gctacacttg 


ccagcgccct 


agcgcccgct 


cctttcgctt 


tcttcccttc 


120 


ctttctcgcc 


acgttcgccg 


gctttccccg 


tcaagctcta 


aatcgggggc 


tccctttagg 


180 


gttccgattt 


agtgctttac 


ggcacctcga 


ccccaaaaaa 


cttgattagg 


gtgatggttc 


240 


acgtagtggg 


ccatcgccct 


gatagacggt 


ttttcgccct 


ttgacgttgg 


agtccacgtt 


300 


ctttaatagt 


ggactcttgt 


tccaaactgg 


aacaacactc 


aaccctatct 


cggtctattc 


360 


ttttgattta 


taagggattt 


tgccgatttc 


ggcctattgg 


ttaaaaaatg 


agctgattta 


420 


acaaaaattt 


aacgcgaatt 


ttaacaaaat 


attaacgttt 


acaatttcag 


gtggcacttt 


480 


tcggggaaat 


gtgcgcggaa 


cccctatttg 


tttatttttc 


taaatacatt 


caaatatgta 


540 


tccgctcatg 


aattaattct 


tagaaaaact 


catcgagcat 


caaatgaaac 


tgcaatttat 


600 


tcatatcagg 


attatcaata 


ccatattttt 


gaaaaagccg 


tttctgtaat 


gaaggagaaa 


660 


actcaccgag 


gcagttccat 


aggatggcaa 


gatcctggta 


tcggtctgcg 


attccgactc 


720 


gtccaacatc 


aatacaacct 


attaatttcc 


cctcgtcaaa 


aataaggtta 


tcaagtgaga 


780 


aatcaccatg 


agtgacgact 


gaatccggtg 


agaatggcaa 


aagtttatgc 


atttctttcc 


840 


agacttgttc 


aacaggccag 


ccattacgct 


cgtcatcaaa 


atcactcgca 


tcaaccaaac 


900 
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cgttattcat tcgtgattgc gcctgagcga gacgaaatac gcgatcgctg ttaaaaggac 960 

aattacaaac aggaatcgaa tgcaaccggc gcaggaacac tgccagcgca tcaacaatat 1020 

tttcacctga atcaggatat tcttctaata cctggaatgc tgttttcccg gggatcgcag 1080 

tggtgagtaa ccatgcatca tcaggagtac ggataaaatg cttgatggtc ggaagaggca 1140 

taaattccgt cagccagttt agtctgacca tctcatctgt aacatcattg gcaacgctac 1200 

ctttgccatg tttcagaaac aactctggcg catcgggctt cccatacaat cgatagattg 1260 

tcgcacctga ttgcccgaca ttatcgcgag cccatttata cccatataaa tcagcatcca 1320 

tgttggaatt taatcgcggc ctagagcaag acgtttcccg ttgaatatgg ctcataacac 1380 

cccttgtatt actgtttatg taagcagaca gttttattgt tcatgaccaa aatcccttaa 1440 

cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga 1500 

gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg 1560 

gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc 1620 

agagcgcaga taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag 1680 

aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc 1740 

agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg 1800 

cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac 1860 

accgaactga gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga 1920 

aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt 1980 

ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag 2040 

cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg 2100 

gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta 2160 

tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc 2220 

agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cctgatgcgg 2280 

tattttctcc ttacgcatct gtgcggtatt tcacaccgca tatatggtgc actctcagta 2340 

caatctgctc tgatgccgca tagttaagcc agtatacact ccgctatcgc tacgtgactg 2400 

ggtcatggct gcgccccgac acccgccaac acccgctgac gcgccctgac gggcttgtct 2460 

gctcccggca tccgcttaca gacaagctgt gaccgtctcc gggagctgca tgtgtcagag 2520 

gttttcaccg tcatcaccga aacgcgcgag gcagctgcgg taaagctcat cagcgtggtc 2580 

gtgaagcgat tcacagatgt ctgcctgttc atccgcgtcc agctcgttga gtttctccag 2640 

aagcgttaat gtctggcttc tgataaagcg ggccatgtta agggcggttt tttcctgttt 2700 

ggtcactgat gcctccgtgt aagggggatt tctgttcatg ggggtaatga taccgatgaa 2760 

acgagagagg atgctcacga tacgggttac tgatgatgaa catgcccggt tactggaacg 2820 



ttgtgagggt aaacaactgg cggtatggat gcggcgggac cagagaaaaa tcactcaggg 2880 

tcaatgccag cgcttcgtta atacagatgt aggtgttcca cagggtagcc agcagcatcc 2940 

tgcgatgcag atccggaaca taatggtgca gggcgctgac ttccgcgttt ccagacttta 3000 

cgaaacacgg aaaccgaaga ccattcatgt tgttgctcag gtcgcagacg ttttgcagca 3060 

gcagtcgctt cacgttcgct cgcgtatcgg tgattcattc tgctaaccag taaggcaacc 3120 

ccgccagcct agccgggtcc tcaacgacag gagcacgatc atgcgcaccc gtggggccgc 3180 

catgccggcg ataatggcct gcttctcgcc gaaacgtttg gtggcgggac cagtgacgaa 3240 

ggcttgagcg agggcgtgca agattccgaa taccgcaagc gacaggccga tcatcgtcgc 3300 

gctccagcga aagcggtcct cgccgaaaat gacccagagc gctgccggca cctgtcctac 3360 

gagttgcatg ataaagaaga cagtcataag tgcggcgacg atagtcatgc cccgcgccca 3420 

ccggaaggag ctgactgggt tgaaggctct caagggcatc ggtcgagatc ccggtgccta 3480 

atgagtgagc taacttacat taattgcgtt gcgctcactg cccgctttcc agtcgggaaa 3540 

cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat 3600 

tgggcgccag ggtggttttt cttttcacca gtgagacggg caacagctga ttgcccttca 3660 

ccgcctggcc ctgagagagt tgcagcaagc ggtccacgct ggtttgcccc agcaggcgaa 3720 

aatcctgttt gatggtggtt aacggcggga tataacatga gctgtcttcg gtatcgtcgt 3780 

atcccactac cgagatatcc gcaccaacgc gcagcccgga ctcggtaatg gcgcgcattg 3840 

cgcccagcgc catctgatcg ttggcaacca gcatcgcagt gggaacgatg ccctcattca 3900 

gcatttgcat ggtttgttga aaaccggaca tggcactcca gtcgccttcc cgttccgcta 3960 

tcggctgaat ttgattgcga gtgagatatt tatgccagcc agccagacgc agacgcgccg 4020 

agacagaact taatgggccc gctaacagcg cgatttgctg gtgacccaat gcgaccagat 4080 

gctccacgcc cagtcgcgta ccgtcttcat gggagaaaat aatactgttg atgggtgtct 4140 

ggtcagagac atcaagaaat aacgccggaa cattagtgca ggcagcttcc acagcaatgg 4200 

catcctggtc atccagcgga tagttaatga tcagcccact gacgcgttgc gcgagaagat 4260 

tgtgcaccgc cgctttacag gcttcgacgc cgcttcgttc taccatcgac accaccacgc 4320 

tggcacccag ttgatcggcg cgagatttaa tcgccgcgac aatttgcgac ggcgcgtgca 4380 

gggccagact ggaggtggca acgccaatca gcaacgactg tttgcccgcc agttgttgtg 4440 

ccacgcggtt gggaatgtaa ttcagctccg ccatcgccgc ttccactttt tcccgcgttt 4500 

tcgcagaaac gtggctggcc tggttcacca cgcgggaaac ggtctgataa gagacaccgg 4560 

catactctgc gacatcgtat aacgttactg gtttcacatt caccaccctg aattgactct 4620 

cttccgggcg ctatcatgcc ataccgcgaa aggttttgcg ccattcgatg gtgtccggga 4680 

tctcgacgct ctcccttatg cgactcctgc attaggaagc agcccagtag taggttgagg 4740 
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ccgttgagca ccgccgccgc aaggaatggt gcatgcaagg agatggcgcc caacagtccc 4800 

ccggccacgg ggcctgccac catacccacg ccgaaacaag cgctcatgag cccgaagtgg 4860 

cgagcccgat cttccccatc ggtgatgtcg gcgatatagg cgccagcaac cgcacctgtg 4920 

gcgccggtga tgccggccac gatgcgtccg gcgtagagga tcgagatctc gatcccgcga 4980 

aattaatacg actcactata ggggaattgt gagcggataa caattcccct ctagaaataa 5040 

ttttgtttaa ctttaagaag gagatataca tatggctagc atgactggtg gacagcaaat 5100 

gggtcgggat ccg atg agt aac ttc tct gga ttt acg aaa gga acc gat 5149 
Met Ser Asn Phe ser Giy Phe Thr Lys Giy Thr Asp 
1 5 10 

ate gca gac tta gat gcg gtg get caa acg etc aag aag cca gca gac 5197 
lie Ala Asp Leu Asp Ala val Ala Gin Thr Leu Lys Lys Pro Ala Asp 
15 20 25 

gat gca aac aaa gcg gtt aat gac teg ata gca gca ttg aaa gat aag 5245 
Asp Ala Asn Lys Ala Val Asn Asp Ser lie Ala Ala Leu Lys Asp Lys 
30 35 40 

cct gac aac ccg gcg eta ctt get gac tta caa cat tea att aat aaa 5293 
Pro Asp Asn Pro Ala Leu Leu Ala Asp Leu Gin His ser lie Asn Lys 
45 50 55 60 

tgg teg gta att tac aat ata aac tea acc ata gtt cgt age atg aaa 5341 
Trp Ser val lie Tyr Asn lie Asn ser Thr lie val Arg Ser Met Lys 
65 70 75 

gac tta atg caa gqc ate eta cag aag ttc cca etc gag cac cac cac 5389 
Asp Leu Met Gin Giy lie Leu Gin Lys Phe Pro Leu Glu His His His 
80 85 90 

cac cac cac tga gatceggctg ctaacaaagc ccgaaaggaa gctgagttgg 5441 
His His His 
95 

ctgctgccac cgctgagcaa taactagcat aaccccttgg ggcctctaaa egggtcttga 5501 
ggggtttttt gctgaaagga ggaactatat ceggat 5537 



<210> 


14 


<211> 


5309 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


pET24b 


<220> 




<221> 


terminator 


<222> 


(26).. (72) 


<223> 


T7 terminator 


<220> 




<221> 


mi sc_f eature 


<222> 


(140).. (157) 


<223> 


His-tag sequence 


<220> 





<221> misc_feature 

<222> (206).. (238) 

<223> T7 Tag 

<220> 

<221> misc_feature 

<222> (309).. (309) 

<220> 

<221> miscfeature 

<222> (309).. (309) 

<223> T7 transcription start 

<220> 

<221> promoter 

<222> (310).. (327) 

<223> T7 promoter 

<220> 

<221> misc_feature 

<222> (713).. (716) 

<223> Alternate start codon "« 

<220> 

<221> CDS 

<222> (713). .(1792) 

<223> laci 

<220> 

<221> CDS 

<222> (3935). .(4747) 

<223> kan 



atccggatat 


agttcctcct 


ttcagcaaaa aacccctcaa 


gacccgttta 


gaggccccaa 


60 


ggggttatgc 


tagttattgc 


tcagcggtgg cagcagccaa 


ctcagcttcc 


tttcgggctt 


120 


tgttagcagc 


cggatctcag 


tggtggtggt ggtggtgctc 


gagtgcggcc 


gcaagcttgt 


180 


cgacggagct 


cgaattcgga 


tcccgaccca tttgctgtcc 


accagtcatg 


ctagccatat 


240 


gtatatctcc 


ttcttaaagt 


taaacaaaat tatttctaga 


ggggaattgt 


tatccgctca 


300 


caattcccct 


atagtgagtc 


gtattaattt cgcgggatcg 


agatctcgat 


cctctacgcc 


360 


ggacgcatcg 


tggccggcat 


caccggcgcc acaggtgcgg 


ttgctggcgc 


ctatatcgcc 


420 


gacatcaccg 


atggggaaga 


tcgggctcgc cacttcgggc 


tcatgagcgc 


ttgtttcggc 


480 


gtgggtatgg 


tggcaggccc 


cgtggccggg ggactgttgg 


gcgccatctc 


cttgcatgca 


540 


ccattccttg 


cggcggcggt 


gctcaacggc ctcaacctac 


tactgggctg 


cttcctaatg 


600 


caggagtcgc 


ataagggaga 


gcgtcgagat cccggacacc 


atcgaatggc 


gcaaaacctt 


660 


tcgcggtatg 


gcatgatagc 


gcccggaaga gagtcaattc 


agggtggtga 


at gtg aaa 
Met Lys 


718 



gta acg tta tac gat gtc gca gag tat gcc ggt gtc tct tat cag 
i val Thr Leu Tyr Asp val Ala Glu Tyr Ala Gly val ser Tyr Gin 



acc gtt tec cgc gtg gtg aac cag gec age cac gtt tct gcg aaa acg 
Thr val Ser Arg vat vat Asn Gin Ala Ser His val Ser Ala Lys Thr 
20 25 30 

egg gaa aaa gtg gaa gcg gcg atg gcg gag ctg aat tac att ccc aac 
Arq Glu Lys val Glu Ala Ala Met Ala Glu Leu Asn Tyr lie Pro Asn 
35 40 45 50 

cgc gtg gca caa caa ctg gcg ggc aaa cag teg ttg ctg att ggc gtt 
Arg val Ala Gin Gin Leu Ala Gly Lys Gin ser Leu Leu lie Gly val 
55 60 65 

gec acc tec agt ctg gee ctg cac gcg ccg teg caa att gtc gcg gcg 
Ala Thr ser Ser Leu Ala Leu His Ala Pro Ser Gin lie val Ala Ala 
70 75 SO 

att aaa tct cgc gec gat caa ctg ggt gee age gtg gtg gtg teg atg 
lie Lys Ser Arg Ala Asp Gin Leu Gly Ala Ser val Val val Ser Met 
85 90 95 

gta gaa cga age ggc gtc gaa gee tgt aaa gcg gcg gtg cac aat ctt 
val Glu Arg Ser Gly val Glu Ala Cys Lys Ala Ala val His Asn Leu 
100 105 110 

etc gcg caa cgc gtc agt ggg ctg ate att aac tat ccg ctg gat gac 
Leu Ala Gin Arg val Ser Gly Leu lie lie Asn Tyr Pro Leu Asp Asp 
115 120 125 130 

cag gat gee att get gtg gaa get gee tgc act aat gtt ccg gcg tta 
Gin Asp Ala lie Ala val Glu Ala Ala Cys Thr Asn val Pro Ala Leu 
135 140 145 

ttt ctt gat gtc tct gac cag aca ccc ate aac agt att att ttc tec 
Phe Leu Asp val ser Asp Gin Thr Pro lie Asn ser lie lie Phe Ser 
150 155 160 

cat gaa gac ggt acg cga ctg ggc gtg gag cat ctg gtc gca ttg ggt 
His Glu Asp Gly Thr Arg Leu Gly val Glu His Leu val Ala Leu Gly 



His Gin Gin lie Ala Leu Leu Ala Gly Pro Leu Ser ser val Ser Ala 
180 185 190 

cgt ctg cgt ctg get ggc tgg cat aaa tat etc act cgc aat caa att 
Arg Leu Arg Leu Ala Gly Trp His Lys Tyr Leu Thr Arg Asn Gin lie 
195 200 205 " 210 

cag ccg ata gcg gaa egg gaa ggc gac tgg agt gec atg tec ggt ttt 
Gin Pro lie Ala Glu Arg Glu Gly Asp Trp ser Ala Met Ser Gly Phe 
215 220 225 

caa caa acc atg caa atg ctg aat gag ggc ate gtt ccc act gcg atg 
Gin Gin Thr Met Gin Met Leu Asn Glu Gly lie val Pro Thr Ala Met 
230 235 240 

ctg gtt gee aac gat cag atg gcg ctg ggc gca atg cgc gee att acc 
Leu val Ala Asn Asp Gin Met Ala Leu Gly Ala Met Arg Ala lie Thr 
245 250 255 

gag tec ggg ctg cgc gtt ggt gcg gat ate teg gta gtg gga tac gac 
Glu Ser Gly Leu Arg val Gly Ala Asp lie Ser val val Gly Tyr Asp 
260 265 270 



gat acc gaa gac age tea tgt tat ate ccg ccg tta acc acc ate aaa 
Asp Thr Glu Asp Ser Ser Cys Tyr lie Pro Pro Leu Thr Thr lie Lys 
275 280 285 290 

cag gat ttt cgc ctg ctg ggg caa acc age gtg gac cgc ttg ctg caa 
Gin Asp Phe Arg Leu Leu Gly Gin Thr Ser Val Asp Arg Leu Leu Gin 
295 300 305 

etc tct cag ggc cag gcg gtg aag ggc aat cag ctg ttg ccc gtc tea 
Leu ser Gin Gly Gin Ala val Lys Gly Asn Gin Leu Leu Pro val Ser 
310 315 320 

ctg gtg aaa aga aaa acc acc ctg gcg ccc aat acg caa acc gee tct 
Leu val Lys Arg Lys Thr Thr Leu Ala Pro Asn Thr Gin Thr Ala ser 
325 330 335 

ccc cgc gcg ttg gec gat tea tta atg cag ctg gca cga cag gtt tec 
Pro Arq Ala Leu Ala Asp ser Leu Met Gin Leu Ala Arg Gin Val ser 
340 345 350 



cga ctg gaa age ggg cag tgagegcaac gcaattaatg taagttagct 1822 
Arg Leu Glu Ser Gly Gin 
355 360 

cactcattag geacegggat ctcgaccgat geccttgaga gccttcaacc cagtcagctc 1882 

cttccggtgg gegeggggea tgactatcgt cgccgcactt atgactgtct tctttatcat 1942 

geaactegta ggacaggtgc cggcagcgct ctgggtcatt tteggegagg accgctttcg 2002 

ctggagcgcg acgatgatcg gcctgtcgct tgeggtatte ggaatcttgc acgccctcgc 2062 

tcaagccttc gtcactggtc ccgccaccaa aegtttegge gagaagcagg ccattatcgc 2122 

cggcatggcg gccccacggg tgegcatgat cgtgctcctg tcgttgagga cccggctagg 2182 

ctggcggggt tgccttactg gttagcagaa tgaatcaccg atacgegage gaacgtgaag 2242 

cgactgctgc tgcaaaacgt ctgcgacctg agcaacaaca tgaatggtct tcggtttccg 2302 

tgtttcgtaa agtctggaaa cgcggaagtc agcgccctgc accattatgt teeggatctg 2362 

categcagga tgctgctggc taccctgtgg aacacctaca tctgtattaa egaagegctg 2422 

gcattgaccc tgagtgattt ttctctggtc ccgccgcatc cataccgcca gttgtttacc 2482 

ctcacaacgt tccagtaacc gggcatgttc atcatcagta acccgtatcg tgagcatcct 2542 

ctctcgtttc ateggtatea ttacccccat gaacagaaat cccccttaca eggaggcate 2602 

agtgaccaaa caggaaaaaa ccgcccttaa catggcccgc tttatcagaa gecagacatt 2662 

aacgettctg gagaaactca acgagctgga cgcggatgaa caggcagaca tctgtgaatc 2722 

gcttcacgac caegctgatg agctttaccg cagctgcctc gcgcgtttcg gtgatgaegg 2782 

tgaaaacctc tgacacatgc agctcccgga gaeggtcaca gcttgtctgt aagcggatgc 2842 

egggagcaga caagcccgtc agggegegtc agcgggtgtt ggcgggtgtc ggggegcage 2902 

catgacccag teaegtageg atageggagt gtatactggc ttaactatgc ggcatcagag 2962 

cagattgtac tgagagtgca ecatatatge ggtgtgaaat accgcacaga tgcgtaagga 3022 

13 



gaaaataccg 


catcaggcgc 


tcttccgctt 


cctcgctcac 


tgactcgctg 


cgctcggtcg 


3082 


ttcggctgcg 


gcgagcggta 


tcagctcact 


caaaggeggt 


aatacggtta 


tccacagaat 


3142 


caggggataa 


cgcaggaaag 


aacatgtgag 


caaaaggeca 


geaaaaggee 


aggaacegta 


3202 


aaaaggccgc 


gttgctggcg 


tttttccata 


ggctccgccc 


ccctgacgag 


catcacaaaa 


3262 


atcgacgctc 


aagtcagagg 


tggegaaace 


cgacaggact 


ataaagatac 


caggegttte 


3322 


cccctggaag 


ctccctcgtg 


cgctctcctg 


ttccgaccct 


gccgcttacc 


ggatacctgt 


3382 


ccgcctttct 


cccttcggga 


agcgtggcgc 


tttctcatag 


ctcacgctgt 


aggtatctca 


3442 


gttcggtgta 


ggtcgttcgc 


tccaagctgg 


gctgtgtgca 


cgaacccccc 


gttcagcccg 


3502 


accgctgcgc 


cttatccggt 


aactategtc 


ttgagtccaa 


cceggtaaga 


cacgacttat 


3562 


cgccactggc 


agcagccact 


ggtaacagga 


ttagcagagc 


gaggtatgta 


ggcggtgcta 


3622 


cagagttctt 


gaagtggtgg 


cctaactacg 


gctacactag 


aaggacagta 


tttggtatct 


3682 


gcgctctgct 


gaagccagtt 


acctteggaa 


aaagagttgg 


tagctcttga 


tccggcaaac 


3742 


aaaccaccgc 


tggtagcggt 


ggtttttttg 


tttgeaagea 


gcagattacg 


cgcagaaaaa 




aaggatctca 


agaagatcct 


ttgatctttt 


etaeggggtc 


tgacgctcag 


tggaacgaaa 


3862 


actcacgtta 


agggattttg 


gtcatgaaca 


ataaaactgt 


ctgettacat 


aaacagtaat 


3922 


acaaggggtg 


tt atg age 
Met Ser 


cat att caa egg gaa acg tct tgc ■ 
His lie Gin Arg Glu Thr Ser cys : 


tct agg ccg 
Ser Arg Pro 


3973 



cga tta aat tec aac atg gat get gat tta tat ggg tat aaa tgg get 
Arg Leu Asn ser Asn Met Asp Ala Asp Leu Tyr Gly Tyr Lys Trp Ala 
375 380 385 

cgc gat aat gtc ggg caa tea ggt gcg aca ate tat cga ttg tat ggg 
Arq Asp Asn val Gly Gin Ser Gly Ala Thr lie Tyr Arg Leu Tyr Gly 
390 395 400 405 

aag ccc gat gcg cca gag ttg ttt ctg aaa cat ggc aaa ggt age gtt 
Lys Pro Asp Ala Pro Glu Leu Phe Leu Lys His Gly Lys Gly Ser val 
410 415 420 

gee aat gat gtt aca gat gag atg gtc aga eta aac tgg ctg acg gaa 
Ala Asn Asp val Thr Asp Glu Met val Arg Leu Asn Trp Leu Thr Glu 
425 430 435 

ttt atg cct ctt ccg acc ate aag cat ttt ate cgt act cct gat gat 
Phe Met Pro Leu Pro Thr lie Lys His Phe lie Arg Thr Pro Asp Asp 
440 445 450 

gca tgg tta etc acc act gcg ate ccc ggg aaa aca gca ttc cag gta 
Ala Trp Leu Leu Thr Thr Ala lie Pro Gly Lys Thr Ala Phe Gin val 
455 460 465 

tta gaa gaa tat cct gat tea ggt gaa aat att gtt gat gcg ctg gca 
Leu Glu Glu Tyr Pro Asp Ser Gly Glu Asn lie Val Asp Ala Leu Ala 
470 ' 475 480 485 

gtg ttc ctg cgc egg ttg cat teg att cct gtt tgt aat tgt cct ttt 

val Phe Leu Arg Arg Leu His Ser lie Pro val Cys Asn Cys Pro Phe 



490 495 500 

aac age gat cgc gta ttt cgt etc get cag gcg caa tea cga atg aat 
Asn Ser Asp Arg val Phe Arg Leu Ala Gin Ala Gin ser Arg Met Asn 
505 510 515 

aac ggt ttg gtt gat gcg agt gat ttt gat gac gag cgt aat gqc tgg 
Asn Gly Leu val Asp Ala Ser Asp Phe Asp Asp Glu Arg Asn Gly Trp 
520 525 530 

cct gtt gaa caa gtc tgg aaa gaa atg cat aaa ctt ttg cca ttc tea 
Pro val Glu Gin val Trp Lys Glu Met His Lys Leu Leu Pro Phe Ser 
535 540 545 

ccg gat tea gtc gtc act cat ggt gat ttc tea ctt gat aac ctt att 
pro Asp ser val val Thr His Gly Asp Phe ser Leu Asp Asn Leu lie 
550 555 560 565 

ttt gac gag ggg aaa tta ata ggt tgt att gat gtt gga cga gtc gga 
Phe Asp Glu Gly Lys Leu lie Gly Cys lie Asp Val Gly Arg val Gly 
570 575 580 

ate gca gac cga tac cag gat ctt gec ate eta tgg aac tgc etc ggt 
lie Ala Asp Arg Tyr Gin Asp Leu Ala lie Leu Trp Asn Cys Leu Gly 
585 590 595 

gag ttt tct cct tea tta cag aaa egg ctt ttt caa aaa tat ggt att 
Glu Phe ser Pro ser Leu Gin Lys Arg Leu Phe Gin Lys Tyr Gly lie 
600 605 610 

gat aat cct gat atg aat aaa ttg cag ttt cat ttg atg etc gat gag 
Asp Asn Pro Asp Met Asn Lys Leu Gin Phe His Leu Met Leu Asp Glu 
615 620 625 



ttt ttc taagaattaa ttcatgagcg gatacatatt tgaatgtatt tagaaaaata 

Phe Phe 

630 


4797 


aacaaatagg 


ggttccgcgc 


acatttcccc 


gaaaagtgcc acctgaaatt 


gtaaacgtta 


4857 


atattttgtt 


aaaattcgcg 


ttaaattttt 


gttaaatcag 


ctcatttttt 


aaccaatagg 


4917 


ccgaaategg 


caaaatccct 


tataaatcaa 


aagaatagac 


cgagataggg 


ttgagtgttg 


4977 


ttccagtttg 


gaacaagagt 


ccactattaa 


agaacgtgga 


ctccaacgtc 


aaagggcgaa 


5037 


aaacegtcta 


teagggegat 


ggcccactac 


gtgaaccatc 


accctaatca 


agttttttgg 


5097 


ggtcgaggtg 


cegtaaagea 


etaaategga 


accctaaagg 


gagcccccga 


tttagagctt 


5157 


gaeggggaaa 


gccggcgaac 


gtggcgagaa 


aggaagggaa 


gaaagcgaaa 


ggagcgggcg 


5217 


etagggeget 


ggcaagtgta 


gcggtcacgc 


tgegegtaac 


caccacaccc 


gccgcgctta 


5277 


atgcgccgct 


acagggegeg 


tcccattcgc 


ca 






5309 



<210> 15 

<211> 30 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer, HT-YscF Start 



<400> 15 

cgggatccga tgagtaactt ctctggattt 



30 



<210> 16 

<2U> 30 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer, HT-YscF Stop 

<400> 16 

ccgctcgagt gggaacttct gtaggatgcc 

<210> 17 

<211> 87 

<212> PRT 

<213> Yersinia pestis KIM5 
<220> 

<223> YscF amino acid seuquence from Y. pestis KIMS 

<223> sequence can be found at MedLine accession number NC_004839 



Met Ser Asn Phe ser Gly Phe Thr Lys Gly Thr Asp lie Ala Asp Leu 
15 10 15 

Asp Ala val Ala Gin Thr Leu Lys Lys Pro Ala Asp Asp Ala Asn Lys 
20 25 30 

Ala val Asn Asp ser lie Ala Ala Leu Lys Asp Lys Pro Asp Asn pro 
35 40 45 

Ala Leu Leu Ala Asp Leu Gin His Ser lie Asn Lys Trp ser val He 
50 55 60 

Tvr Asn lie Asn ser Thr lie val Arg Ser Met Lys Asp Leu Met Gin 
65 70 75 80 

Gly lie Leu Gin Lys Phe Pro 
85 

<210> 18 
<211> 87 
<212> PRT 

<213> Yersinia pestis C092 
<220> 

<223> YscF amino acid sequence from Y. pestis C092 

<223> sequence can be found at MedLine accession number NC_003131 

16 



Met ser Asn Phe Ser Gly Phe Thr Lys Gly Thr Asp He Ala Asp Leu 
15 10 15 

Asp Ala val Ala Gin Thr Leu Lys Lys Pro Ala Asp Asp Ala Asn Lys 
20 25 30 

Ala val Asn Asp ser lie Ala Ala Leu Lys Asp Lys Pro Asp Asn Pro 
35 40 45 

Ala Leu Leu Ala Asp Leu Gin His Ser lie Asn Lys Trp ser val He 
50 55 60 

Tyr Asn He Asn ser Thr He val Arg Ser Met Lys Asp Leu Met Gin 
65 70 75 80 

Gly lie Leu Gin Lys Phe Pro 

<210> 19 

<211> 87 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Consensus sequence for YscF 



Met ser Asn Phe Ser Gly Phe Thr Lys Gly Thr Asp lie Ala Asp Leu 
15 10 15 

asp Ala val Ala Gin Thr Leu Lys Lys Pro Ala Asp Asp Ala Asn Lys 
20 25 30 

Ala val Asn Asp Ser lie Ala Ala Leu Lys Asp Lys Pro Asp Asn Pro 
35 40 45 

Ala Leu Leu Ala Asp Leu Gin His Ser He Asn Lys Trp ser val lie 
50 55 60 

Tyr Asn lie Asn Ser Thr lie val Arg ser Met Lys Asp Leu Met Gin 
65 70 75 80 

Gly lie Leu Gin Lys Phe Pro 



<210> 20 
<211> 4884 

<212> DNA 



<213> Artificial Sequence 



<220> 

<223> pBADl8-YSCF 



<220> 

<221> misc_feature 

<222> (96) , . (974) 

<223> arac 

<220> 

<221> mi sc_f eatu re 

<222> (1003) (1020) 

<223> operator 02 

<220> 

<221> promoter 

<222> (1125).. (1153) 

<223> PC promoter 

<220> 

<221> misc_feature 

<222> (1161) . . (1182) 

<223> operator 01 

<220> 

<221> misc_feature 

<222> (1204). .(1217) 

<223> CAP site 

<220> 

<221> misc_feature 

<222> (1213).. (1251) 

<223> operator 12 + il 

<220> 

<221> promoter 

<222> (1250).. (1277) 

<223> PBAD promoter 

<220> 

<221> CDS 

<222> (1323).. (1586) 

<223> YscF 

<220> 

<221> misc_feature 

<222> (1634).. (2059) 

<223> rrnB 

<220> 

<221> terminator 

<222> (1634). .(2059) 

<223> rrnB 

<220> 

<221> promoter 

<222> (2114).. (2120) 

<223> bla p3 promoter 

<220> 

<221> miscfeature 

<222> (2152).. (3015) 



<223> bla 
<220> 

<221> rep_origin 

<222> (3051).. (3509) 

<223> M13 origin 



<220> 

<221> rep„origin 

<222> (3515).. (4212) 

<223> pBR322 origin 



<400> 20 
atcgatgcat 


aatgtgcctg 


tcaaatggac 


gaagcaggga 


ttctgeaaac 


cctatgctac 


60 


tccgtcaagc 


cgtcaattgt 


ctgattcgtt 


accaattatg 


acaacttgac 


ggctacatca 


120 


ttcacttttt 


cttcacaacc 


ggcaeggaac 


tcgctcgggc 


tggccccggt 


gcatttttta 


180 


aatacccgcg 


agaaatagag 


ttgategtea 


aaaccaacat 


tgcgaccgac 


ggtggcgata 


240 


ggcatccggg 


tggtgctcaa 


aagcagcttc 


gcctggctga 


tacgttggtc 


ctcgcgccag 


300 


cttaagacgc 


taatccctaa 


ctgctggcgg 


aaaagatgtg 


acagacgega 


eggegacaag 


360 


caaacatgct 


gtgegacget 


ggegatatea 


aaattgctgt 


ctgccaggtg 


ategctgatg 


420 


tactgacaag 


cctcgcgtac 


ccgattatcc 


atcggtggat 


ggagegaetc 


gttaatcget 


480 


tccatgcgcc 


gcagtaacaa 


ttgetcaage 


agatttatcg 


ccagcagctc 


egaatagege 


540 


ccttcccctt 


gcccggcgtt 


aatgatttgc 


ccaaacaggt 


cgctgaaatg 


cggctggtgc 


600 


gcttcatccg 


ggcgaaagaa 


ccccgtattg 


gcaaatattg 


aeggecagtt 


aagecattea 


660 


tgccagtagg 


cgcgcggacg 


aaagtaaacc 


cactggtgat 


accattcgcg 


agcctccgga 


720 


tgacgaccgt 


agtgatgaat 


ctctcctggc 


gggaacagca 


aaatatcacc 


eggteggcaa 


780 


acaaattctc 


gtccctgatt 


tttcaccacc 


ccctgaccgc 


gaatggtgag 


attgagaata 


840 


taacctttca 


ttcccagcgg 


teggtcgata 


aaaaaatcga 


gataacegtt 


ggcctcaatc 


900 


ggcgttaaac 


ccgccaccag 


atgggcatta 


aacgagtatc 


ccggcagcag 


gggatcattt 


960 


tgcgcttcag 


ccatactttt 


catactcccg 


ccattcagag 


aagaaaccaa 


ttgtccatat 


1020 


tgcatcagac 


attgeegtea 


ctgcgtcttt 


tactggctct 


tctcgctaac 


caaaceggta 


1080 


accccgctta 


ttaaaagcat 


tctgtaacaa 


agegggacca 


aagecatgae 


aaaaacgcgt 


1140 


aacaaaagtg 


tctataatca 


eggcagaaaa 


gtccacattg 


attatttgea 


cggcgtcaca 


1200 


ctttgctatg 


ccatagcatt 


tttatccata 


agattagegg 


atcctacctg 


aegcttttta 


1260 


tcgcaactct 


ctactgtttc 


tccatacccg 


tttttttggg 


etagegaatt 


caggaggaaa 


1320 


eg atg agt 
Met ser 
1 


aac ttc tct gga ttt acg aaa gga acc gat ate 
Asn Phe Ser Gly Phe Thr Lys Gly Thr Asp lie 
5 10 


gca gac 
Ala Asp 
15 


1367 



tta gat gcg gtg get caa acg etc aag aag cca gca gac gat gca aac 1415 
Leu Asp Ala vat Ala Gin Thr Leu Lys Lys Pro Ala Asp Asp Ala Asn 
20 25 30 

19 



aaa gcg gtt aat gac teg ata gca gca ttg aaa gat aag cct gac aac 1463 
Lys Ala val Asn Asp ser lie Ala Ala Leu Lys Asp Lys Pro Asp Asn 
35 40 45 

ccg gcg eta ctt get gac tta caa cat tea att aat aaa tgg teg gta. 1511 
Pro Ala Leu Leu Ala Asp Leu Gin His ser lie Asn Lys Trp Ser val 
50 55 60 

att tac aat ata aac tea acc ata gtt cgt age atg aaa gac tta atg 1559 
lie Tyr Asn lie Asn ser Thr lie val Arg ser Met Lys Asp Leu Met 
65 70 75 

caa ggc ate eta cag aag ttc cca taa ggatcccgcg gggatcctct 1606 
Gin Gly lie Leu Gin Lys Phe Pro 
80 85 

agagtcgacc tgcaggcatg caagcttggc tgttttggcg gatgagagaa gattttcagc 1666 

ctgatacaga ttaaatcaga aegcagaage ggtctgataa aacagaattt gcctggcggc 1726 

agtagegegg tggtcccacc tgaccccatg ccgaactcag aagtgaaacg ccgtagcgcc 1786 

gatggtagtg tggggtctcc ecatgegaga gtagggaact gccaggcatc aaataaaacg 1846 

aaaggctcag tcgaaagact gggecttteg ttttatctgt tgtttgtcgg tgaaegctet 1906 

cctgagtagg acaaatccgc egggagegga tttgaacgtt gcgaagcaac ggeceggagg 1966 

gtggcgggca ggacgcccgc cataaactgc caggcatcaa attaagcaga aggccatcct 2026 

gaeggatgge etttttgegt ttctacaaac tcttttgttt atttttctaa atacattcaa 2086 

atatgtatcc gctcatgaga caataaccct gataaatget tcaataatat tgaaaaagga 2146 

agagtatgag tattcaacat ttccgtgtcg cccttattcc ettttttgeg geattttgee 2206 

ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa agatgetgaa gatcagttgg 2266 

gtgcacgagt gggttacatc gaactggatc teaacagegg taagatcctt gagagttttc 2326 

gccccgaaga aegttttcca atgatgagca cttttaaagt tetgetatgt ggegeggtat 2386 

tatcccgtgt tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg 2446 

acttggttga gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag 2506 

aattatgcag tgetgecata accatgagtg ataacactgc ggecaactta cttctgacaa 2566 

egateggagg accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc 2626 

gecttgateg ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca 2686 

cgatgcctgc agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 2746 

tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 2806 

tgcgctcggc ccttccggct ggctggttta ttgctgataa atetggagee ggtgagcgtg 2866 

ggtctcgegg tatcattgea gcactggggc cagatggtaa gccctcccgt ategtagtta 2926 

tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 2986 

gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 3046 



ttgatttacg cgccctgtag cggcgcatta agcgcggcgg gtgtggtggt tacgcgcagc 3106 

gtgaccgcta cacttgccag cgccctagcg cccgctcctt tcgctttctt cccttccttt 3166 

ctcgccacgt tcgccggctt tccccgtcaa gctctaaatc gggggctccc tttagggttc 3226 

cgatttagtg ctttacggca cctcgacccc aaaaaacttg atttgggtga tggttcacgt 3286 

agtgggccat cgccctgata gacggttttt cgccctttga cgttggagtc cacgttcttt 3346 

aatagtggac tcttgttcca aacttgaaca acactcaacc ctatctcggg ctattctttt 3406 

gatttataag ggattttgcc gatttcggcc tattggttaa aaaatgagct gatttaacaa 3466 

aaatttaacg cgaattttaa caaaatatta acgtttacaa tttaaaagga tctaggtgaa 3526 

gatccttttt gataatctca tgaccaaaat cccttaacgt gagttttcgt tccactgagc 3586 

gtcagacccc gtagaaaaga tcaaaggatc ttcttgagat cctttttttc tgcgcgtaat 3646 

ctgctgcttg caaacaaaaa aaccaccgct accagcggtg gtttgtttgc cggatcaaga 3706 

gctaccaact ctttttccga aggtaactgg cttcagcaga gcgcagatac caaatactgt 3766 

ccttctagtg tagccgtagt taggccacca cttcaagaac tctgtagcac cgcctacata 3826 

cctcgctctg ctaatcctgt taccagtggc tgctgccagt ggcgataagt cgtgtcttac 3886 

cgggttggac tcaagacgat agttaccgga taaggcgcag cggtcgggct gaacgggggg 3946 

ttcgtgcaca cagcccagct tggagcgaac gacctacacc gaactgagat acctacagcg 4006 

tgagctatga gaaagcgcca cgcttcccga agggagaaag gcggacaggt atccggtaag 4066 

cggcagggtc ggaacaggag agcgcacgag ggagcttcca gggggaaacg cctggtatct 4126 

ttatagtcct gtcgggtttc gccacctctg acttgagcgt cgatttttgt gatgctcgtc 4186 

aggggggcgg agcctatgga aaaacgccag caacgcggcc tttttacggt tcctggcctt 4246 

ttgctggcct tttgctcaca tgttctttcc tgcgttatcc cctgattctg tggataaccg 4306 

tattaccgcc tttgagtgag ctgataccgc tcgccgcagc cgaacgaccg agcgcagcga 4366 

gtcagtgagc gaggaagcgg aagagcgcct gatgcggtat tttctcctta cgcatctgtg 4426 

cggtatttca caccgcatat ggtgcactct cagtacaatc tgctctgatg ccgcatagtt 4486 

aagccagtat acactccgct atcgctacgt gactgggtca tggctgcgcc ccgacacccg 4546 

ccaacacccg ctgacgcgcc ctgacgggct tgtctgctcc cggcatccgc ttacagacaa 4606 

gctgtgaccg tctccgggag ctgcatgtgt cagaggtttt caccgtcatc accgaaacgc 4666 

gcgaggcagc aaggagatgg cgcccaacag tcccccggcc acggggcctg ccaccatacc 4726 

cacgccgaaa caagcgctca tgagcccgaa gtggcgagcc cgatcttccc catcggtgat 4786 

gtcggcgata taggcgccag caaccgcacc tgtggcgccg gtgatgccgg ccacgatgcg 4846 

tccggcgtag aggatctgct catgtttgac agcttatc 4884 
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